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Abstract 

We consider an evolving network of a fixed number of nodes. The allocation 
of edges is a dynamical stochastic process inspired by biological reproduction dy- 
namics, namely by deleting and duplicating existing nodes and their edges. The 
properties of the degree distribution in the stationary state is analysed by use of 
the Fokker-Planck equation. For a broad range of parameters exponential degree 
distributions are observed. The mechanism responsible for this behaviour is illumi- 
nated by use of a simple mean field equation and reproduced by the Fokker-Planck 
equation treating the degree-degree correlations approximately. In the limit of zero 
mutations the degree distribution becomes a power law. 
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1 Introduction: Networks and evolutionary dynam- 
ics 

Whenever a phenomena can be thought of in terms of components and relations between 
components the mathematical language of graph theory or networks may be helpful to 
the description, analysis and the understanding of the relevant problem of interest. A 
large amount of work is currently being done with the aim to understand the structure 
and statistical properties of networks in the hope that certain aspects of the general 
mathematical characterisation of network structure may be related to common functional 
properties, e.g. vulnerability to breakdown of part of the network [H [21 E]- 
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Our aim in the present paper is to discuss an example of a network, inspired by evo- 
lutionary dynamics where we can relate the emergent network structure to microscopic 
details of the reproductive dynamics generating the network. We think of the network as 
arising from the dynamics of reproducing individuals linked together with other individu- 
als in ways that depend on their type. The nodes represents the species emerging in some 
type space as a result of the mutation prone reproductive dynamics at the level of individ- 
uals. As individuals of various types interact and reproduce they segregate in type space 
allowing local regions of high occupancy to be identified as species. In this way a network 
structure emerges on the space of types. Nodes consists of occupied, or extant, species, 
and edges connecting the these nodes represent the interactions between the species. The 
basic dynamics is individually driven and consists of individuals being born for then to 
disappear again somewhat later at random killing event (driven as a Poisson process). 
At the level of individuals the dynamics doesn't appear very exciting. However at the 
systems level, the dynamics of the occupancy density in type space exhibit interesting 
emerging structures. We will focus on the functional form of the degree distribution and 
its relation to the amount of mutation involved in the underlying reproductive dynamics. 

Stumpf and co-workers have studied the properties of sub-networks obtained by ran- 
dom sampling nodes in a larger network [1]. They showed that only binomial (or Poisson) 
networks are invariant under decimation. If the large network has a binomial degree dis- 
tribution a randomly sampled subset of the network will also exhibit a binomial degree 
distribution. Here we are concerned with evolutionary dynamics on very large and corre- 
lated networks. The dynamics generates sub-networks typically with exponential degree 
distributions even when the full network has a binomial degree distribution. 

The paper is organised as follows. To set the scene we first briefly describe the 
individual based Tangled Nature model of evolutionary ecology and review a few aspects 
of the phenomenology of the networks emerging at the level of types Next we use 

a simplified model defined at the level of types - or species - to gain some understanding of 
how evolutionary dynamics on correlated networks can give rise to exponential networks 
and, in certain limits, to a power law degree distribution. 

2 Tangled Nature model 

The basic of the Tangled Nature model [HI El [7] is as follows. Individuals, {a,/3, ...} are 
described by type vectors T° = ( Tf, , T£). The number of individuals of type T at 
time t is denoted by n{T,t. Different types influence each other through an interaction 
matrix (J-matrix) that accounts for all possible interactions between any possible set 
of types. Selection leads to only a small fraction of types being occupied and their 
interactions will be described by a small subset of the elements of this complete matrix. 
The structure of the interaction network between extant types is found to depend on the 
statistical properties of J. A proportion, 6, of the elements of the J-matrix, J(T", T^) are 
assigned non-zero (an non symmetric) values all other elements are zero. The interactions 
assigned in the type space can either be uncorrelated [HI [5] , or correlated [HI [7] . If no 
correlations are present in the type space the evolved networks of interactions between 
extant species exhibit a binomial degree distribution as does the underlying network of 
non- zero J-matrix elements [3]. The correlated case is more interesting and the one we 
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will concentrate on in this paper. Correlations are made to decay exponentially with 
separation in type space. This implies that off-spring will see a set of interactions which 
are very similar to the interactions of the parent even when mutations make the off-spring 
differ from the parent. When correlations are present in J the evolutionary dynamics 
generates networks with exponential degree distributions in contrast to the binomial 
distributions exhibited by a network constructed by randomly seleceting positions in 
type space. 

The dynamics of the system consists of the following time step. First randomly choose 
an individual and attempt to remove the individual with constant probability pkui that is 
independent of type and time. Next randomly choose an individual and replicated it with 
a probability that depends on the type of the individual and the configuration in type 
space. In the Tangled Nature model a probabilistic weight is calculated that depends 
on how the type of the chosen individual interact with other extant types. This value is 
determined through the use of a weight function, 

"^-'^'> - - EZ^'^m - E C,T^T)„,T,*) - (1) 

This is monotonically mapped to the interval [0,1], appropriate for a probability measure, 
by using the following function, 

_ exp[//(T°,t)] 

The sums of Eq.(lT]) are made over the points in type space, T, and the occupancies, 
n(T, t) are used to account for the multiplicity of individuals with the same type vector. 
We imagine a well mixed system and otherwise neglect spatial aspects. We stress that 
the type space is a pre-defined, complete set of all possible types and it is evolution 
and contingency that select the actualised types in the evolved system. The J-matrix is 
similarly a pre-defined complete set of all possible interactions for all possible types that 
may exist in potentia. The elements of the matrix are exponentially correlated in the 
following way [6] 

C(J(T", T^), J(T^ T^)) = exp[-A(T", T^)/^] G [0, 1), (3) 

Here, ^ is the correlation length and C(J(T", T'^), J(T^, T'*')) is the correlation between 
the interaction strengths of two types Ta and TjS when each are interacting with a third, 
T7. 

The evolutionary dynamics generates a set of occupied types linked together according 
to the interaction matrix J. We are here concerned with the properties of the degree dis- 
tribution of this interaction network. Simulations found that the degree distribution P{k) 
is of exponential form P{k) oc exp(— A;/A;o) whenever the interaction matrix is correlated 



3 Simple node model 

To understand this phenomenology we now neglect the fluctuations present at the level 
of individual based dynamics and assume a more coarse grained view point in which we 
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consider types as either occupied or not. I.e. we turn the measure n{T,t) into a binary 
equal to 1 when n(T, t) > and zero when n{T, t) = 0. We consider the dynamics at the 
level of types which implies that creation events correspond to one type splitting into two 
types (a speciation event) and annihilation events correspond to a type going extinct. 
We elevate the dynamics of the individual based Tangled Nature model to the level of 
types in the following way. 

For simplicity we consider a fixed number D (for diversity) of types. A timestep con- 
sists of choosing a type, or node, at random and remove it together with all its associated 
edges. Next another node, a parent type, is randomly selected from the remaining D—1 
nodes and is duplicated in the form of a daughter. All types connected to the parent 
are now given connections to the daughter with probability Pg. All types unconnected to 
the parent are given connections to the daughter with probability, P„. An edge between 
the daughter and parent is placed with probability PjE- These probabilities represent the 
similarity or correlations between offspring and parent induced in the Tangled Nature 
model by the correlations in the interaction matrix J. 

This procedure is simple to simulate and produces, independent of initial configura- 
tion, after a relatively short transient, networks with degree distributions that behave 
exponentially to a very good approximation except in the limit Pe 1 and vanishing 
Pn and Pp. We will in a moment write down the complete Fokker-Planck equation for 
the degree distribution of a network evolving according to this process. The full equation 
is, however, rather involved and can only be solved by numerical iteration. It is there- 
fore illuminating to make the following simplistic and heuristic considerations. Let nk{t) 
denote the number of nodes of degree k after t timesteps. Let us focus solely on the 
following three aspects of the dynamics: 1) a node of degree k is selected for annihilation, 
this occurs with probability nk{t)/D. 2) a parent node of degree k receives an edge to the 
daughter, this occurs with probability PpHkit)/ D. 3) A node receives an edge because it 
is selected to become a neighbour of the daughter (of a degree kp parent) with probability 

(Pe + Pn)[kp/D][nu^{t)/D] ^ (Pe + PnW) / D]. (4) 

As we are seeking a qualitative mean field equation, we substituted the average degree 
{k) in the last expression. Note we have deliberately neglected the direct effect of the 
daughter. The rationale behind this is that there is only one daughter node, but the 
daughter has typically several neighbour nodes. We combine these events to obtain the 
following iterative equation. 

n,{t + 1) = n,(t) - !^ + Pv + {Pe + Pnm ^^^ ^^^^^^^ 

The stationary solution to this equation is readily obtained self-consistently to be 

n(A;) = n(0)exp[-A;/A;o], (6) 

with 

ko = 1/ ln(l + [Pp + (Pe + P„)(fc)]-'), (7) 

and 

<*) = ^(;|)-'*- 

*For a similar model with P„ = see [TU] 
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The dependence on Pg of ko in this solution is in quahtative agreement with the change 
in the exponential part of the degree distribution obtained in simulations of the network, 
see Fig. [H However, this simplistic mean field discussion is only of heuristic value. We 
now present the full Fokker-Planck equation for the process. 
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Figure 1: Simulated degree distributions of the node model for D = 200 and Pp = 0.01 
using the imperfect duplication process. From short to long tail we have Pe = 0.01, 0.25, 
0.75, 0.95, 0.99, 0.999 and P„ was chosen to be P„ = Pp(l - Pe)/(1 - Pp) in order to keep 
the connectance fixed (Laird & Jensen, 2007). 



4 Fokker-Planck equation 

Let us now develop the Fokker-Planck equation for the ensemble averaged time dependent 
number of nodes of degree k, nk{t), constrained by the condition, '^f.nk{t) = D. We 
structure the analysis in the following way. Removal (R): The effect of removing, from a 
population of D, a node and its edges described by a rate term TptiD, k,t). Duplication 
(D): The effect of introducing, into a population of D — 1, a new daughter node and 
attaching edges described by a rate term T£)u{D — 1, k, t). Our equation has accordingly 
the form. 

Tikit + 1) = nfe(t) + TniD, k, t) + TDuiD - 1, fc, t). (9) 

Removal of a node affects the network in two ways: (F^) the node removed from the 
network and (F^) the effect on the nodes adjacent, i.e. sharing edges, with the node 
being removed. Therefore 

Tnik) = -r^{k) + r%{k+i) - rj,{k). (lo) 

The effect of the duplication process is conveniently broken up into three sub-effects: 
(X^Du) effect on the parent, (F^^) the effect on the daughter, (F^^) the effect on the 
adjacent nodes, i.e. those that will receive an extra edge as a result of the duplication. 
Hence 

Tnuik) = T^^^ik-1) - FL(A:) + VUk) + T^^ik-l) - F^„(A:). (11) 
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We have suppressed the timestep, t, and network size, D, for notational ease. 

Next we derive detailed expressions for each of the terms above. The direct effect on 
Uk of removing a node of degree, k is to decrement its value. The probability of selecting 
a node of degree k is Uk/N, and therefore, 

^R{k) = f . (12) 

After the removal, the degree of the nodes connected to the removed node, i.e. the 
adjacent nodes, will decrease by one. For this we need the Edge probability, 

PEdikii /c2, q) — Prob{node of degree ki is connected to q nodes of degree k^}- (13) 

In general we do not have an analytic expression for PEd{ki,k2,q), but below we give 
approximate forms neglecting, or treating non-rigousrly, degree-degree correlations. Here 
we note 

r^(^) = Ei^E^^^<^(^-^'^)- (14) 

kr=i g=i 

The first sum is over the degree of the removed node, the second sum is over the number, 
q, of nodes of degree k = 0,1, N—1 the removed node is connected to. 

A node of degree k is selected for duplication with probability nk/{D — 1). The 
daughter of this node receives an edge to the parent with probability Pp. Thus the parent 
increases its degree by one with probability 

^Uk) = P,^- (15) 

The new daughter node can add to Uk by an amount determined by the probability of 
finishing with k edges, 

r^^ik) = PpA{k - 1) + (1 - Pp)m- (16) 

To keep track of the bookkeeping we have introduced a new probability 

A{k) — Prob{daughter receives k edges to nodes different from the parent}, (17) 
which is given by 

Z)—2 min[fcp,fc] min[D—2—kp,k] 

m-Yl E E ^^s{k,+k2-k)n{k,,k2,kp), (18) 

kp=0 ki=0 k2=0 

The right hand side adds up the probabilities associated with the process where the 
daughter inherits ki edges to nodes already connected to the parent. Each of these edges 
is inherited by the daughter with probability, Pg- The daughter may receive an additional 
k2 — k — ki edges to nodes not connected to the parent. Each of these edges are attached 
to the daughter with probability Pn- The factor Q{ki,k2,kp) denotes the probability that 
ki edges are inherited among the kp nodes connected to the parent and k2 edges connect 
to nodes not connected to the parent. 
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Next we consider the effect of the duphcation on the adjacent nodes and we need to 
distinguish between nodes sharing an edge with the parent (Ed) and nodes not connect to 
the parent {nEd). Let us first consider the Ed nodes. We introduced above PEd{kp, k, qe) 
as the probabihty that a mode, here the parent, of degree kp is connected to qE nodes 
of degree k. The duphcation process will, with probabihty P^, attach a new edge from 
the daughter to each of these nodes and thereby increase their degree from k to k + 1. 
Let us now turn to the nEd nodes. There are D — 2 — kp nodes which share no edge 
with the parent. With probabihty PnPnEd{D — 2 — kp, k, qnE) a total of qnE of these 
nodes are of degree k and will receive a new edged to the daughter. Here PnEd{.D — 2 — 
kp, k, qnE) is equivalent to PsdiD, k, q) introduced in Eq. (|T^ . though PnEdikp, k, qnE) is 
concerned with the D — 2 — kp nodes that a node of degree kp (in a set of D — 1 nodes) 
is not connected to. Among these D — 2 — kp nodes qnE have degree k with probability 
PnEd{D — 2 — kp, k, qnE)- Therefore we have, 

ru*) = EEE E EE^(4.+*-«) 

q=0 kp=0 Ki=0 K2=0 gi=0(ji2=0 

q^PEdikp,k,K,) ( 2 ) P^'^-Pe)^'-'' 
PnEd{kp,k,K2) ( 2 ) Pra-Pn)""''''. 

(20) 

Degree-degree correlations induced by the evolutionary dynamics makes is difficult 
to write an explicit form for PEdiki,k2,q) and PnEdiki,k2,q). One can neglect these 
correlations altogether and try to estimate PEd and PnEd by purely binomial arguments 
in the following way. First we deal with PEd{ki, k2, q). The ki edges emerging from the 
degree fci node connects (in this approximation) to nodes of degree ^2 with probability 
(nfcj — 5^1,^2)/ (-D — 1) [remember there are D — 1 nodes when the duplication takes place] 
hence we use 

PEdiku h,q)=^^ ) ) [I - ) ■ (21) 

When we treat PnEd{ki,k2,q) in the same approximation we obtain PnEd{ki,k2,q) = 
PEd{D — 2 — ki, k2, q) since we now pick q nodes among the D — 2 — ki nodes not connected 
to the degree ki node under consideration. One seems, however, to obtain solutions that 
numerically converge better towards the results obtained by direct simulation (See Fig. 
[U and Fig. [2]) if one tries to account for the correlations in the allocations of the edges by 
estimating Ped{ki, ^2, q) using the following urn argument. We place M = Uk edges in 
an urn. The edges are of two types. Type A edges correspond to the |^| = fc2(^fe2 —(^^1,^2) 
edges connecting to nodes of degree k2- In addition we have \B\ = M — \A\ type B edges 
connecting nodes of degree different from ^2- The probability that among ki randomly 
picked edges q are of type A and ki — q are of type B is given by 

PEd{k„ k2,q)={^^ ) (^) (1 - -j^) . (22) 
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Again we assume PnEd{ki, k2, q) = PEd{D — 2 — ki, k2, q). In general it is not simple to find 
analytic solutions to this somewhat involved set of equations. The result of iterating the 
Fokker-Planck Eq. ([9]) using these estimates is shown in Fig. [2]for diversity D = 20, which 
makes the numerical iteration manageable. We notice good qualitative agreement with 
the behaviour of simulation results presented Fig. [TJ For a broad range of values of the 
parameters the degree distributions exhibit much of the same approximate exponential 
form as produced by the individual based Tangled Nature model described in the Sec. O 




Figure 2: The degree distribution obtained by iteration of the Fokker-Planck equation 
©. The exponential form is visible for a broad range of parameter values in the linear- log 
plot to the left. The approach towards a 1/k dependence in the limit of Pg ^ 1 can be 
seen in the log-log plot to the right. The two straight lines have slope -1. The parameters 
are D = 20, Pe = 0.01, 0.1, 0.3, 0.5, 0.7, 0.9, 0.95 and Pp = 0.01. P„ was chosen to be 
P„ = Pp(l - P,)/(l - Pp) ) 

Let us finally mention that direct simulations [H] of the simple model introduced in 
section [3] show that in the limit Pg ^ 1, P„ — and Pp <^ 1 the degree distribution rik 
behaves like 1/k. The Fokker-Planck equation Eq. confirms this result. In the limit 
Pg = 1 and P„ — s> (i.e. perfect replication) the Fokker-Planck equation reduces to 

D-2 ki 

+ ^ ^qnki[^PEdiki,k + l,q) - ^ _ ^ PEd{kuk,q) 

ki = l q=l 

-^PEdiki, k, q) + -^^P^,(fci, - 1, q)]. (23) 
Including only the leading terms from ki = 1 and g = 1 one obtains 

Ukit + 1) = nu -\ ; tH \nk--\ — Ukl 

1 1 

+ - kuk} - jj^ikrik -{k- l)n,,_i}]. (24) 

In the limit D ^ 1 and Pp -C 1 this equation has the stationary solution Uk oc 1/k. 
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5 Summary and Discussion 



We have considered networks, consisting of a constant mimber of nodes, with the alloca- 
tion of edges evolving according to dynamical rules inspired by biological reproduction. 
We discussed mathematically the mechanisms behind the exponential form of the degree 
distribution found typically except in the limit where the reproduction doesn't involve 
any mutations at all. The Fokker-Planck equation we derived allows us to verify that 
the degree distribution changes from exponential to a power law in the limit where the 
replication is of high fidelity. 

The generality of the dynamics may make the results relevant to a range of problems 
including protein-protein interacting networks and routers dynamics in communication 
networks. 
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